From keystrokes to annotated process data: Enriching the output of Inputlog with linguistic information
نویسندگان
چکیده
Keystroke logging tools are a valuable aid to monitor written language production. These tools record all keystrokes, including backspaces and deletions together with timing information. In this paper we report on an extension to the keystroke logging program Inputlog in which we aggregate the logged process data from the keystroke (character) level to the word level. The logged process data are further enriched with different kinds of linguistic information: part-of-speech tags, lemmata, chunk boundaries, syllable boundaries and word frequency. A dedicated parser has been developed that distils from the logged process data word-level revisions, deleted fragments and final product data. The linguistically-annotated output will facilitate the linguistic analysis of the logged data and will provide a valuable basis for more linguistically-oriented writing process research. The set-up of the extension to Inputlog is largely language-independent. As proof-of-concept, the extension has been developed for English and Dutch. Inputlog is freely available for research purposes.
منابع مشابه
From Character to Word Level: Enabling the Linguistic Analyses of Inputlog Process Data
Keystroke-logging tools are widely used in writing process research. These applications are designed to capture each character and mouse movement as isolated events as an indicator of cognitive processes. The current research project explores the possibilities of aggregating the logged process data from the letter level (keystroke) to the word level by merging them with existing lexica and usin...
متن کاملEFL Learners' Sensitivity to Linguistic and Discourse Factors in the Process of Anaphoric Resolution
The readers' ability to integrate current information with given information has been considered as an important component of reading comprehension process. One aspect of this integration process involves anaphoric resolution. The purpose of this study is to investigate the process of anaphoric resolution, focusing on inferential rigidity of different types of anaphoric ties. Ninety EFL learner...
متن کاملWriters on the Move: Visualizing Composing Processes Involved in Academic Writing
The present research study aimed to explore covert processes of editing and revision which were involved in writing four different academic text genres (i.e. abstract, conclusion, data commentary, and cover letter) in English language. To this end, six EFL learners with Persian as their mother were recruited to participate in this study. All the participants attended an induction session and ea...
متن کاملEnriching Language Data through Projected Structures
This paper explores the potential for annotating and enriching data for minority or endangered languages via the alignment and projection of structure from annotated and parsed data for a resource-rich language such as English. The work presented here draws inspiration from the work of (Yarowksy and Ngai, 2001), who tested the methods for projecting linguistic annotations from one language to a...
متن کاملMultiple attribute group decision making with linguistic variables and complete unknown weight information
Interval type-2 fuzzy sets, each of which is characterized by the footprint of uncertainty, are a very useful means to depict the linguistic information in the process of decision making. In this article, we investigate the group decision making problems in which all the linguistic information provided by the decision makers is expressed as interval type-2 fuzzy decision matrices where each of ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012